Gene Avin_26580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_26580 
Symbol 
ID7761566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2719249 
End bp2721867 
Gene Length2619 bp 
Protein Length872 aa 
Translation table11 
GC content71% 
IMG OID643805536 
ProductPentapeptide repeat protein 
Protein accessionYP_002799809 
Protein GI226944736 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATCA TCAAACCCCT GAGGCTCGGC GTGCTGCACC GCACCTATCA CTGGCGGCAC 
GGCCATCGCC TCGCCGTGAC GGCGATCGCC CTGGCGACCC TGGAGGAGTC TCCGGTGCTG
CTACCCGAGC AGGAACTCTG GGCGTTGCTC GACGAGGCGC TGGACGAGAA CGAGCAGATC
GACCTGCTGA TGCCCAAGCC ATGTCCGGAA TTCCTGGTCA ACGGTCACGC CTACAACGTC
CACGGGACGG ACCGCCGGCG CTGCCGGGTG CAGGCGCGGC TGGACGACCG CTGCAAGGCC
CTGGTGGTCC ATGGCGACCG GTACTGGGCG GAAGGCGAGG CCGGCGAGCC GGCTGACTTC
GAGGCCATGC CGCTCGGCCG GCGACGGGCC TATGGCGGCC CCGGTTACGA GGCCAATCCC
ATCGGCATCG GCCATGTCCC GGAGTCGGTC GACGGGGGCG AGCGCTGGCC CTTGCCGAAC
GTCGAGCACG ACGGACAGCC GCCGTTCGCC CCCGGCCAGC CGCCCGGGAC GCCGGCGGGT
TTCGGCATGC GCGACATCGA TGCGCCGGCG CACCGGGCGA AGCTGGGCCA GTACGACGAA
GAGACGCTGG AGCGGGACGG TCCGGGGCTG GCGGAGTCGT TCGACTGGCG CTTCTTCAAT
CTGGCGCCGG ACGACCAGCA ATGGCCGGAT CGCGACCGCC TGGCGGGCGG CCTGGAGTAC
GAATTCCTCA ACCTGCACCC CGAACTGGCG CGCCTCGCCG GGCACCTGCC GGACGGATTG
GCGCGCTGCT TCGTCATGCG CCAGCCGGCC GAAGGCGAAT CCGCCCTGGA AGAGATTCCG
CTGCGTCTCA CCACGGCCTG GTTCTTTCCC GACCGGCGGC GCGTGGCGCT CATCCACCAT
GGCGACCTGG CCGTCGACGA CGAGCAGGCT TCCGACATCC AGTACCTGAT GCCGGCCTTC
GAGGCCGGCG CCGCGCCGCG TGGCCTGGAG CACTACGCCG AGGCGCTGGC GCGGCGGCTG
GACGAGGAGG AGGGCGACCT GTACGCCTTC GACGAGGCGG CGCTGATCCA CGAACCCTTC
ATCGGCGCCG GCTTCGATAC CGAGCCGCTG GACCAGGGGC CGTCCGATCC GCTGATGGAC
AACCTGTTGC GCCGGCAGGA AGAGGCAATG CGCAACGAAC GCGAGCGGAT GAACTCGCTC
GGCCTCGATC CGCAGCGCCT GGCGGGCCTC GCGCCCGCCG GGGACGACGA GGAACTGCGC
CTGCAGCGGC TGGCCGACCT GCCGCGGGTC GGGCGCAATA TCCGGCGCAA GGAGGCGGAA
CTCGAAGCCC AGGCGGAACG CGAGCGGGCC GCCGCCCTGG AGCGCCTGCG CGACCAGGAG
CCGACGACGG CGAACCGGGA ACTGCTGGCG CAACTGGAAA ACCCGGACAG GGAACTTCCG
CCTTTCGATT TCGCCGCCCG TTCCGCGCAG TTGCGGGAGG TCTACCGGAT GGACTTTTCC
CCGGCGTCCG CCGACTTGCC GTCCCGGGAG GAAAGCGAGC GGCGCCTGCG CCAGCAGTAC
CGGGACTCGG TGCATTGCTG CGGGGCCGCC CCGGCCCTGC AGGGCGCGGC GGCCGAGGCC
CTGCGCCGGA AGGTGGCGCA AGCCTATGCG CGGGATCGCG ACCTGGCGGG TATGGATCTG
ACCGGCGCCG ATCTGTCCGG CATGGACCTG TCCGGCGCGC GGCTGACCGG CGTCCTGCTG
GAAAGCGCCA ACCTGGCGGA TGCGCGGCTG GACGGCGCCG ATCTGCGCGA GGCGATCCTC
GCCAGGGCCC GCCTGGACGG CGCCTCGTTG CGCGGCGCCG ATTGCCGGGG CGCCAACCTG
TCCCGCGCGC AGGCCCGCAA CGCCTGTTTC AGTGGCGCGA CCTTCGGCGA CGGGCAATGC
TGGGAAGCAC GCTTCGAGAC CTGCGATTTC AGCCACGCCC GGTTCGGCGG GATACTGTGG
CAGGGGTGCG AACTGGACGC CTGCCGTTTC GACGGGGCGG TGCTGGAAGA CTTGTCCCTG
CATGCCTGCC GGCTGAGCCG GCCTTCCTTC GTCGGGGCGA GCCTGAGTGC CGTCACCTGG
GTCGAGTCGA GCCTGGAGGC GGCCGATTTC GAGCGCGCGG AGCTGGACGA CTGCAGTCTG
GTGGAGACGC GCTCGCCCGA TGCCCGTTTC GTCGGGGCCG CCCTGAGTGC GTGCTACATG
GTGCTCGGCA GCTCCCTCGA GGCGGCGGAT TTCGGCGGCG CGCGGCTGGC CGAAAGCAAC
CTGCGCGGGG TGGATTTGTC CGGTGCGCGC TTCTGCGGGA CGCGCCTGGC CGATTGCGAT
CTGTCGGAGG CCCGCCTGGC GGGCGCGGAT CTGCGCCGGG CGACGGCCAG CGGCTGCCTG
TTCAGCGGCG CCGACCTGCG TACCGCCCGG CTCGGAGACG CCCATCTGAT GCAGTGTCTG
CTGCGCCGCG CGGATCTGCG CGGCGCCGAT CTGCGCGGCG CTTCGCTGTT CGGCAGCGAT
CTGGCCGAGG TGCATCTGGA CGAGGACAGC CTGCTGGACG AGACGGATTT CGGCCGGGTC
GCTTTCCACC CGCGCCGCCG CTCGGAGGCC GCGTCGTGA
 
Protein sequence
MEIIKPLRLG VLHRTYHWRH GHRLAVTAIA LATLEESPVL LPEQELWALL DEALDENEQI 
DLLMPKPCPE FLVNGHAYNV HGTDRRRCRV QARLDDRCKA LVVHGDRYWA EGEAGEPADF
EAMPLGRRRA YGGPGYEANP IGIGHVPESV DGGERWPLPN VEHDGQPPFA PGQPPGTPAG
FGMRDIDAPA HRAKLGQYDE ETLERDGPGL AESFDWRFFN LAPDDQQWPD RDRLAGGLEY
EFLNLHPELA RLAGHLPDGL ARCFVMRQPA EGESALEEIP LRLTTAWFFP DRRRVALIHH
GDLAVDDEQA SDIQYLMPAF EAGAAPRGLE HYAEALARRL DEEEGDLYAF DEAALIHEPF
IGAGFDTEPL DQGPSDPLMD NLLRRQEEAM RNERERMNSL GLDPQRLAGL APAGDDEELR
LQRLADLPRV GRNIRRKEAE LEAQAERERA AALERLRDQE PTTANRELLA QLENPDRELP
PFDFAARSAQ LREVYRMDFS PASADLPSRE ESERRLRQQY RDSVHCCGAA PALQGAAAEA
LRRKVAQAYA RDRDLAGMDL TGADLSGMDL SGARLTGVLL ESANLADARL DGADLREAIL
ARARLDGASL RGADCRGANL SRAQARNACF SGATFGDGQC WEARFETCDF SHARFGGILW
QGCELDACRF DGAVLEDLSL HACRLSRPSF VGASLSAVTW VESSLEAADF ERAELDDCSL
VETRSPDARF VGAALSACYM VLGSSLEAAD FGGARLAESN LRGVDLSGAR FCGTRLADCD
LSEARLAGAD LRRATASGCL FSGADLRTAR LGDAHLMQCL LRRADLRGAD LRGASLFGSD
LAEVHLDEDS LLDETDFGRV AFHPRRRSEA AS