Gene Avin_20870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20870 
Symbol 
ID7761012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2079971 
End bp2081590 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content68% 
IMG OID643804982 
ProductTetratricopeptide repeat (TPR) protein 
Protein accessionYP_002799263 
Protein GI226944190 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCTGG GCGCGCTCCT GGCACTGGGC ATGGCGTCTG TAACTTTCGC CCAGACCGGG 
CGTTCCGCCG TTCCGCTATA CGACAACCTC GGCGATCATC ACTACGCCAT CACGACCGCC
TCGCCACTGG CACAGCGCTA TTTCGACCAG GGCTTGCGGC TCTATTACGC ATTCAACCAT
CAGGAGGCGA TCCGCGTCTT CGAGGAAGCC GTCCGGCTCG ATCCGGACTG CGCCATGTGC
TACTGGGGGA TCGCCCTGGC GCAGGGCCCC AACATCAACG CCCCGATGGA CGCCAGCGCC
GGGGCCGCGG CACATGCCGC GACCCGCAAA GCTCTCGAAC GCAAGGCGAG CCCGAAGGAA
CAGGCCCTTA TCCGTGCCCT CGCGGCACGC TACGCCTCGC CTCCTCCCGA CGACCGGGCC
GCCCTGGACG AAGCCTATGC CCGCGCGATG CGCGAAGTCG TTCGCCAGTA CCCGGAAGAT
CGGGAGGCGG CGACCCTTTT CGCCGAATCG CTCATGGACC TGAACCCGTG GCAGTACTGG
AGCCATGACG GCCAGCCGCG GCCGAACACG CCCGAGCTGC TCACCCAGTT GGAGCGGGTG
ATCGCGGCGA ATCCCGACCA CCCGGGGGCG TGCCACTTCT TCATCCATGC CGTCGAGGCC
GCCCAGCCGG AACGCGCGGT GCCATGCGCG GAACGGCTCG CCGGCCTCAT GCCCGGCGCC
GGCCATCTGG TGCACATGCC CGGACACATC TACGTCCGCG TCGGACGCTA CGAAGACGCC
ATCGAGGCCA ACGAACACGC GGTGCATGCC GACGAAACCT ACATCCGCGA CCAGAATCCG
ACATTCGGCA TCTACGTCGC CGGCTATTAC CCGCACAACT ACGATTTCCT GGCCTTCGCC
GCGAGCATGA TCGGCCGTAG CGGGCAGGCG CTCGGCGCCA CGCGGAAGAT GGCCGAACTG
GTGCCGCAAG CGATGTTGCG GGAGCCCGGC ATGACCTTCC TGCAGCACCA CCAGACCCGC
CGATTACAGA TGCTGGTGCG CTTCGATCGC TGGGACGAGA TTCTCCAAAC CGAGGCCCCG
CCGCCGGATC TTCCCCATGC CAGCGCCCTC TGGCACTACG CCCGGGGCCG GGCGCTGGCG
GCGCGCGGCG ACGTACCGGG AGCCGAGGCG GAGCTGGCGC GACTGCGCGC CACGGCCGGG
AGTCCGCAGA CGGACGCGCT GCGTCTGGAG TTCAATACCT CGGGCGCCAT ACTGAAAATC
GCCTCGCAAG TATTGGCCGG TCACATCGCC GCCGGGAAGG CGGATTTCCC GGGCGCGATC
GGCCACCTGC GCGAAGCGGC CCGTCTGGAG GACGGCCTGG TCTACGGCGA GCCGCCGGAA
TGGACGGTGC CGGTACGCCA GGACCTCGGC CGGGTACTGC TCGAGGCGGG ACGGAACGAG
GAGGCCGAGC AGGCCTTTCG CGAAGACCTG CGACGTTTCC CGGAAAATGG CTGGTCGCTG
CACGGATTGG CCCGGACGCT GGATGCGCAG AACCGCGGCG AGGAAGCGGA TGCCGTCATG
GAACGCTTCC GCAAGGTCTG GGCAGGTGCC GACATGCAAT TGGCGGAAAC CGCACGCTGA
 
Protein sequence
MALGALLALG MASVTFAQTG RSAVPLYDNL GDHHYAITTA SPLAQRYFDQ GLRLYYAFNH 
QEAIRVFEEA VRLDPDCAMC YWGIALAQGP NINAPMDASA GAAAHAATRK ALERKASPKE
QALIRALAAR YASPPPDDRA ALDEAYARAM REVVRQYPED REAATLFAES LMDLNPWQYW
SHDGQPRPNT PELLTQLERV IAANPDHPGA CHFFIHAVEA AQPERAVPCA ERLAGLMPGA
GHLVHMPGHI YVRVGRYEDA IEANEHAVHA DETYIRDQNP TFGIYVAGYY PHNYDFLAFA
ASMIGRSGQA LGATRKMAEL VPQAMLREPG MTFLQHHQTR RLQMLVRFDR WDEILQTEAP
PPDLPHASAL WHYARGRALA ARGDVPGAEA ELARLRATAG SPQTDALRLE FNTSGAILKI
ASQVLAGHIA AGKADFPGAI GHLREAARLE DGLVYGEPPE WTVPVRQDLG RVLLEAGRNE
EAEQAFREDL RRFPENGWSL HGLARTLDAQ NRGEEADAVM ERFRKVWAGA DMQLAETAR