Gene Avin_06990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_06990 
Symbol 
ID7759652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp664297 
End bp665346 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content68% 
IMG OID643803620 
Producthypothetical protein 
Protein accessionYP_002797924 
Protein GI226942851 
COG category[S] Function unknown 
COG ID[COG5345] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.481755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGACT GGAAAGAAAG CGCAAGCCGT GCGTACGGTG GGGCCGACAG CGCCCTGCGG 
CGCATCGTCG GCAACCACTG GCTGGGGCGC GCCCTGGCGC TGTTGCTCGC CCTCTACCTG
CTGGTGACCG GATTGCTGGG CTGGTTCTGG AGCCTGGAGC CGGATGCCTT CCCGGTGCAG
GAGAACGCCC AGGCCTCGGC CGAGCAGGCG CAGCGCAAGT TCGTCAAGGG CTACACCACC
GTCGAGACCC TGCGGACCGT GGCCGGTACC CTGCTCGACA AGCCGGGCGG CTACCTGACC
AACGACCTGT CCCCGCCCGG CATCTGGCTG GACAACATGC CCAGTTGGGA ATTCGGCGTG
CTGACCCAGG TGCGCGACCT GGCGCGCTCG CTGCGCAAGG AAATGGCCCG TTCGCAGTCG
CAGTCCACCG AAGACCCGGA TCTGGCCAAG GCCGAGCCGC GCTTCAACTT CGACAACCGT
AGCTGGGCGC TGCCGGCCTC CGAAACCGAA TACCGCGCCG GTCTCAAGCT GCTCGACAGC
TATCTGGCGC GCCTGGCCGA CCCGGTCAAG CCCAGCGCGC AGTTCTTCGC CCGCGCCGAC
AACCTGAATG GCTGGCTGGG CGACGTCGCC ACCCGTCTCG GCTCGCTCTC CCAGCGGCTC
TCGGCGAGCA TCGGCCAGGA GCGCCTGGAT GCCGACCTGG TGCCCGACGA GGAGACCGGC
CAGGTACAGC AGGGCGAAGT CGTCAAGACG CCCTGGCTGC AGATCGACAA CGTGTTCTAC
GAGGCGCGCG GCCAGGCTTG GGCGCTGGCG CATTTCCTGC GCGCCATCGA GGTGGACTTC
GGCGACGTGC TGGCGCGCAA GAACGCCACC GTCAGCCTCC AGCAGATCAT CCGCGAGCTG
GAAGCGGCGC AGGAGCCGCT GTGGAGCCCG ATGGTGCTGA ACGGCGGCGG CTACGGCATG
CTGGCCAACC ACTCGCTGGT GATGGCTAAC TTCATCTCCC GGGCCAATGC CGCGCTGATC
GACCTGCGCG CGCTGCTTTC CCAGGGCTGA
 
Protein sequence
MLDWKESASR AYGGADSALR RIVGNHWLGR ALALLLALYL LVTGLLGWFW SLEPDAFPVQ 
ENAQASAEQA QRKFVKGYTT VETLRTVAGT LLDKPGGYLT NDLSPPGIWL DNMPSWEFGV
LTQVRDLARS LRKEMARSQS QSTEDPDLAK AEPRFNFDNR SWALPASETE YRAGLKLLDS
YLARLADPVK PSAQFFARAD NLNGWLGDVA TRLGSLSQRL SASIGQERLD ADLVPDEETG
QVQQGEVVKT PWLQIDNVFY EARGQAWALA HFLRAIEVDF GDVLARKNAT VSLQQIIREL
EAAQEPLWSP MVLNGGGYGM LANHSLVMAN FISRANAALI DLRALLSQG