Gene Avin_40090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_40090 
Symbol 
ID7762896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4063178 
End bp4065358 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content69% 
IMG OID643806869 
Productouter membrane copper transport protein 
Protein accessionYP_002801121 
Protein GI226946048 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01778] TonB-dependent copper receptor 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGCG ACAAGTTTCG CCGTGCGGCG CGCCCGCACC TGCCCAAGAC CTTTTCCCGC 
CTCCTGCCGC ACTGGCGCCG GGCACCGGCG CATGCCGCGC TGCTCGGCCT ACTGGCCAGC
GGCAGCCTGC CGGCCGCCGA GCCGGAAGAC GACAGCGACT CCCTGCAAAT GTCGCCGCTG
GTGATCACCG GCGTGGCCCT GGATGCGCCG CTGACCGTGG TGACCGACCC GCGCATCCCC
CGCCAGCCGG TGCCGGCCAG CGACGGCGCC GACTACCTGA AGACCATTCC CGGCTTTTCC
GCGATCCGCA GCGGCGGGGT GAACGGCGAC CCGGTGTTCC GCGGCATGTT CGGCTCGCGG
CTCAAGCTGT TGACCAACGG CGGCGAGATG ATCGGCGCCT GCCCGGCGCG GATGGATTCG
CCCAGTTCCT ACATCGCGCC GGAAACCTTC GACGAACTGA CCGTCATCAA GGGCCCGCAG
AGTGTGCTGC ATGGCCCCGG CGCCTCGGCC GCCACCGTCC TCTTCGAACG CAACCCGGAA
CGTTTCGACG CGCCGGGCGG GCGTGTCGAC ATGAGCTTCC TGGCCGGCTC CAACGGTCGC
TTCGACCGCC GCATCGATGC TGCCGCCGGC GCCGAGCAGG GTTATTTCCG CCTGCTCGCC
AACCGCTCCG ACGCGGACGA CTACGAGGAT GGCGACGGCG ACACCGTGCC CTCGCGCTGG
GACAAGTGGA CCAGCGAATT CGTCGCCGGC TGGACGCCGA CCGCCGACAG CCTGCTGGAA
CTCACCTACG GCATCGGCGA CGGCGAGTCC CGCTACGCCG GGCGCGGCAT GGACGGCACC
CGGTTCGACC GCGAGAGCCT GGGGCTGCGC TTCGAGCTGG AGAACATCGG CGAGGTTTTC
CAGAAGGTCG AGGCGCGGCT CTACTACAAC TACGCCGACC ACGTGATGGA CAACTTCCGC
CTGCGCAGCC CGGATGCGGA TTCCAGCATG CCGATGCCGA TGGCCAGCAA CGTCGACCGC
CGCACCCTGG GCGGACGCCT GACCGGCACC TGGCGCTGGG AGCACTACCA ACTGGTCGCC
GGCCTCGACT ACCAGACCAG CGAGCACCGC GAGCGCTCCT CCACCTACAC GAGTGCGAGC
CACGGCATGT CCATGGGTCA TCACGTCATG ACCAGCGCCT CGTTCGTCGA CGCCGACGAC
TATCCCTGGA ACAAGGACGC CGATTTCCAC AACTTCGGCC TGTTCGGCGA ACTGACCCGG
CACCTGGGCG ACGCCTCGCG GCTCGTCCTC GGCGCGCGGC TGGATCGCAG TTACGCCGAG
GATCACCGCG ATACGCTGAG CGGCAGCATG GGCATGTCGA GCTGGAACAA TCCCACCGCC
GGCAAGGAGC GCGGCGACAC CCTGCCCAGC GGCTTCGTCC GCTACGAACA CGGCCTCGCC
GGCATTCCGG CGACGGCCTA TGTCGGCCTG GGGCACACCC AGCGTTTCCC GGACTACTGG
GAACTGTTTT CCTCCACCAA CAGCATGGGC CGGACCAGCG CCTTCGATTC CATCCACCCG
GAGAAGACCA CCCAGCTCGA CTTCGGCATC CAGTACGCGC GGGGGCCGCT GGAAGCCTGG
GCCTCCGGCT ACCTCGGCTG GGTGCGGGAT TTCATCCTGT TCGACTACGC CAGCGGCATG
TCGGGCATGG GTTCGTCCTC GGCGCGCAAC ATCGACGCAC GCATCTTCGG CGGCGAGGCG
GGTCTGTCCT ACCGCTTGAG CCAGCATTGG AAGACCGACG CCAGCCTGGC CTACGCCTGG
GGCAAGAACA GCTCGGACGG CCGCGCGCTG CCGCAGATCC CGCCGCTGGA AGGGCGTTTC
AGCCTGACCT ACGAGCAGGG CGACTGGAGC GCCTCCGGCC TGTGGCGGCT GGTGGCCAGG
CAGACGCGGG TGGCGGAAGG CCAGGGCAAC GTGGTGGGGC AGGATTTCGG CAAGAGCGCC
GGCTTCGGCG TGCTCTCCTT CAACGGCGCG CACCGCTTCA ACCGGCACCT CAAGCTCAGC
GCCGGCGTCG ACAACCTGCT GGACAAGCGC TACAGCGAAC ACCTCAACCT GGCGGGCAAC
GCCGGCTTCG ACTATCCGGG CGACACGCGG ATCAACGAGC CGGGGCGGAC CTTGTGGGCG
CGGGTGGACC TGAGCTTCTG A
 
Protein sequence
MNRDKFRRAA RPHLPKTFSR LLPHWRRAPA HAALLGLLAS GSLPAAEPED DSDSLQMSPL 
VITGVALDAP LTVVTDPRIP RQPVPASDGA DYLKTIPGFS AIRSGGVNGD PVFRGMFGSR
LKLLTNGGEM IGACPARMDS PSSYIAPETF DELTVIKGPQ SVLHGPGASA ATVLFERNPE
RFDAPGGRVD MSFLAGSNGR FDRRIDAAAG AEQGYFRLLA NRSDADDYED GDGDTVPSRW
DKWTSEFVAG WTPTADSLLE LTYGIGDGES RYAGRGMDGT RFDRESLGLR FELENIGEVF
QKVEARLYYN YADHVMDNFR LRSPDADSSM PMPMASNVDR RTLGGRLTGT WRWEHYQLVA
GLDYQTSEHR ERSSTYTSAS HGMSMGHHVM TSASFVDADD YPWNKDADFH NFGLFGELTR
HLGDASRLVL GARLDRSYAE DHRDTLSGSM GMSSWNNPTA GKERGDTLPS GFVRYEHGLA
GIPATAYVGL GHTQRFPDYW ELFSSTNSMG RTSAFDSIHP EKTTQLDFGI QYARGPLEAW
ASGYLGWVRD FILFDYASGM SGMGSSSARN IDARIFGGEA GLSYRLSQHW KTDASLAYAW
GKNSSDGRAL PQIPPLEGRF SLTYEQGDWS ASGLWRLVAR QTRVAEGQGN VVGQDFGKSA
GFGVLSFNGA HRFNRHLKLS AGVDNLLDKR YSEHLNLAGN AGFDYPGDTR INEPGRTLWA
RVDLSF