Gene Avin_49810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_49810 
Symbol 
ID7763833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5046214 
End bp5049153 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content58% 
IMG OID643807813 
Producthypothetical protein 
Protein accessionYP_002802047 
Protein GI226946974 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGCGCTG TGCTCTGGTT CGAGGGCACA AGCTCCTCGG CGTCGGATCT GAAAAAACTT 
TGGCTGTTAA GCGAGTCCCC CGAGCAGACT CTGGAGCATT GGCTCAACCA GCATCACGCA
GGAGGCATGC GGGCTCTGCG TCAGATGGAG AAGGATGGGA CGGGTATCTG GCCTTATCCC
AGCAAGAAAG CCTTCTTGGC CAGGCTGAGG GATTACTTCG AGGTCGGTGA AAACGCATTC
ACCTTGCTCA ACCGCGCTGC CGGCCTGAAG CAGCTCAATA GTATTGATGA GATCTTTCGA
GAGTTGGTGC TGGATGACCG CTCTGCCTTC GAGCGGGCTG CAGAGGTCGC CAGCAGCTTC
GATGATCTGA CAGACATTCA CCGAGAGCTG GAAACCGCTA GAAAGCAGCA GCGCTCATTG
CAACCGGTCG CCGATAGCTG GGAGCGTTAT AAGGCCTTGC AAGAACAGTT GCAGGATAAA
CAGGCTCTCG AAGGCATCCT CCCGGTCTGG TTCGCCGAGC AGGGGTATCG CCTGTGGTTG
GCTGAAACCA GGAGGCTCGA GAAGGAACAC AAACAGGCCG AACTGGATCA GGAGCAATGC
AGGAGTCAGC TTGAGGTCCA GAAAGGTGTG GTCGATCAAC ATCGTCAGCG CTACCTGCGA
GTAGGTGGCG CAGGGATAGA TCAATTACGC GAGCGCATCG CTGATTGGGT CAGGGAGTGC
GATAAGCGCA GCCTGAAAGC CGAGCAATAC CGACGCTTGG CCAAAGGGCT TGGACTAGCG
GATGAGTTAT CGGCCGCTGC TCTCAAGGAG AATCAGCAGC AGATCGCGGC ACGCCTGGAA
ATACTGGCCC AACAAATTAC TGACGCGCGC CAAAAGGCGT TCGACGCTGG TCTCGTCCAG
CAGGGGCTCA ATGGTCGCCT ACAGACCTTA CAGCAGGAAC GTGCTGAGGT TGAGAGACGG
CCAGGCTCGA ACCTGCCCGG GCAATTTCAT GCGTTTCGAA GCGATCTGGC GCAGGCGCTT
GACGTCGATG AGTCGGCCCT GCCTTTCGTT GCCGAGCTGG TGCAGGTCAA GCCCGAGGAA
CTGGCCTGGC GGGGCGCCAT TGAGCGGGCA GTCGGTAGCC ACCGACTACG TATCCTGGTG
CCGCAGGGCT CATCCCAGGC CGCACTGCGT TGGGTGAACC AACGGCACAA CCGCTTGCAT
GTGCGCTTGT TGGAGGTAAA GGAGCCGTCT TCGCGCCCCG TGTTCTTCGA CGATGGTTTT
ACCCGCAAGC TGACTTTCAA GGAGCATCCG TACCGAGAAA CCGTAAAGGC ACTACTGGCG
GACAATGACC GCCACTGCGT CGAAAGCACG GAGCAATTGC GTCATACCCC TCACTCCATG
ACGGCCCAGG GGCTGATGTC GGGCAAAGAA CGCTTCTTCG ACAAGCAGGA CCAGAAGCGC
CTGGACGAAG ACTGGCTGAC AGGCTTCGAC AACCGCGACC GTCTGGCCTT CTTGGCTGAG
CAGATTCGCG AGGTCAATGA ACAGCTCGCG CCTGCCAAGC TGGCCTTGGA TGCTGCCCAG
GACGATGCCC GTCAGTTGGA GACTCAAGCT TCGCTGCTGA AACGCGTGGA AGAACTGCAG
TTCGAGGATA TTGATCTGCC GGGAGCCCAG AGTCAGCTGG AGTCGCTGCG TACGCAACTG
ACCACCCTGA CGCGCCCCGA TTCTGACCTT GCCATGATCA AGTCCGAACT GGATAAAGCC
GAGGGCCTGC AGGACTCTCT GGATCAACAG CTCCGCCAAC TGATCGAGCA GTGCGTTCAG
CTAAAAACTC AGTTTGATCA GGCCGCATCC GCTACCCGTA AGGCATATAA CGGCGCGGAG
AAAGGGCTGA ACGATACGCA GCGTGAACTG GCGAAAGAGT ACTTCCCTAC TCTGGCCCCA
GAAGACCTAG GCGACATTAT CGAGTTGGAG CGCAAGCATA CGCGGGAGCT TCAACAGCAG
CTCAAATCGC TCAGCGAAAA ACTGAGTTAT CAGCAAGCAG AGCTTGCTAA ACGGATGTCC
GACGCCCTGA AAGTAGACAC CGGTGCGCTT TCAGAGGTCG GTCGGGAGCT GGCGGACATA
CCCAAGTATC TGGAGCGTCT GCGTGTGCTG ACCGAGGAGG CCCTACCCGA GAAACTCAAG
CGCTTCCTGG AGTATCTCAA TCGCTCCTCA GATGATGGGG TCACCCAGTT GCTCAGCTAT
ATCGATCATG AGGTCTCGAT GATCGAGGAA CGCCTAGATG ACCTGAACAG TACAATGCAG
CGTGTCGACT TCCAGCCAGG GCGCTATCTA CGTTTGGTTG CAGGCAAGGT CATCCATGAA
AGTCTACGCA CGCTGCAACG CGCTCAGCGC CAATTGAATT CGGCGCGCTT CATCGATGAT
GAAGGTGAGA GCCACTACAA GGCATTGCAG GAACTCGTAG GGCTACTCAA AGACGCTTGC
GAACACAGCA GGACTCAAGG GGCCAAGGCG CTTCTGGATC CGCGTTTCCG CCTGGAGTTC
GCGGTCTCGG TGATCGACCG CGAGAGTAAA AATGTCATCG AGACCCGAAC AGGCTCTCAG
GGCGGTAGCG GTGGTGAGAA GGAGATCATC GCCTCCTATG TGCTGACCGC TTCCCTCAGC
TATGCGCTCT GTCCCGACGG CAGCAGCCGG CCGTTGTTCG GCACCATCGT TCTCGACGAG
GCGTTCTCGC GCAGCTCCCA TGCGGTTGCC GGTCGGATCA TCGCGGCGCT GAGGGAATTC
GGTCTGCATG CTGTCTTCAT CACGCCCAAC AAGGAGATGC GCCTGTTGCG CCACCATACG
CGTTCGGCAG TCGTCGTTCA TCGGCGGGGC GTGGAATCCA GTTTGGTTTC CCTGAGTTGG
GAAGCTTTGG ATGAGCATCA TCAACAGCGC ATCAAGGCGA TGCATGAAGT CGCCCACTGA
 
Protein sequence
MGAVLWFEGT SSSASDLKKL WLLSESPEQT LEHWLNQHHA GGMRALRQME KDGTGIWPYP 
SKKAFLARLR DYFEVGENAF TLLNRAAGLK QLNSIDEIFR ELVLDDRSAF ERAAEVASSF
DDLTDIHREL ETARKQQRSL QPVADSWERY KALQEQLQDK QALEGILPVW FAEQGYRLWL
AETRRLEKEH KQAELDQEQC RSQLEVQKGV VDQHRQRYLR VGGAGIDQLR ERIADWVREC
DKRSLKAEQY RRLAKGLGLA DELSAAALKE NQQQIAARLE ILAQQITDAR QKAFDAGLVQ
QGLNGRLQTL QQERAEVERR PGSNLPGQFH AFRSDLAQAL DVDESALPFV AELVQVKPEE
LAWRGAIERA VGSHRLRILV PQGSSQAALR WVNQRHNRLH VRLLEVKEPS SRPVFFDDGF
TRKLTFKEHP YRETVKALLA DNDRHCVEST EQLRHTPHSM TAQGLMSGKE RFFDKQDQKR
LDEDWLTGFD NRDRLAFLAE QIREVNEQLA PAKLALDAAQ DDARQLETQA SLLKRVEELQ
FEDIDLPGAQ SQLESLRTQL TTLTRPDSDL AMIKSELDKA EGLQDSLDQQ LRQLIEQCVQ
LKTQFDQAAS ATRKAYNGAE KGLNDTQREL AKEYFPTLAP EDLGDIIELE RKHTRELQQQ
LKSLSEKLSY QQAELAKRMS DALKVDTGAL SEVGRELADI PKYLERLRVL TEEALPEKLK
RFLEYLNRSS DDGVTQLLSY IDHEVSMIEE RLDDLNSTMQ RVDFQPGRYL RLVAGKVIHE
SLRTLQRAQR QLNSARFIDD EGESHYKALQ ELVGLLKDAC EHSRTQGAKA LLDPRFRLEF
AVSVIDRESK NVIETRTGSQ GGSGGEKEII ASYVLTASLS YALCPDGSSR PLFGTIVLDE
AFSRSSHAVA GRIIAALREF GLHAVFITPN KEMRLLRHHT RSAVVVHRRG VESSLVSLSW
EALDEHHQQR IKAMHEVAH