Gene Avin_39120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_39120 
Symbol 
ID7762801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3962913 
End bp3964157 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content75% 
IMG OID643806775 
Producthypothetical protein 
Protein accessionYP_002801027 
Protein GI226945954 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00513866 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAAAT CCAGGCTCTT GCGTCTTTCC CTGCCGTTGC TCGTCGGCGC CGTGCTGGCC 
GCCTGCGCCG GGCGCCCCGA GCCGCCCGGT CCGCCGCCCG GTGAAAACCT CGAGCGTCGA
CTGGAGGGCG CCTACCGGCC GGCCGCGCGG TCGCCGGTGC GCGGCTGGGA TGACGAATGG
CAGGTGGCCG GACAGGCGGT GGAGGTCAGT TGGCTGGCGC CGGAACAGGT CCAGGGGATG
CCGCTGATCC TCTATCTGCC GGGGCTCGGC GAGAGCAGCC GGGACGGTGT GCAATGGCGC
CGGGCCTGGG CCGAGGCCGG CTACGCGGTG CTTTCGGTAC AACCTCGGCA GTACGGACGG
GTGATCTACT CCAGCTCCGA GGCGCAGGTG GGGGTATTCC GCTCCCTGGC GCAGAAGAGC
TTCGCTGACC AGGCCCTGGC GGCGCGCATC GCCGTGCTCG ATCAGGTCCT GGCGGATCTG
CGCAGACGGG CGCAAGCCGG CGAGCCGAAC CTGGCCAGGG TCGACTGGCA GCGGCTGGCG
GTGGCCGGCT TCGATCTCGG CGCGCAGACC GCCGCCGCCC TGGCCGGCGA GCGCGCGGCC
GGAGCATCGG CGCCGGCCGG CTGGCAGCCG AGGGCGGCGA TCCTGCTCAG TCCCTATGTC
GCCGAGGACG GCGGGGGCGA CCGCTTTGGC CGCATCGGTA CGCCGCTGCT GGCGGTCACC
GGGCCGCACG ACGAGGACCC TTTCGGCTGG GTCGACCCGC CGAGCCGCCG CCAGCGGCTC
TGGGAGGGGG TGAGGACCTC CGGCAGCTAT CAACTGATCG CCGCCGAGGC CAGTCATCGG
CTGCTGAGTG GCTCGTTCGA GGACATGGCC GGAGCGGGCG GCGGGCGTCA GGGCGGACCT
TCGTCCGGCG GCCGGCCGGA GGGGGGCGCT GGTCGTGGCG GAAGCGGCGG GCCTGGCGGC
GGTGGGGGAC CCGGTGGGGG CGGCGGGCCC GGTGGCGGTC CTGCAGGCGG CGGACGCGCC
GGTGGTCCCG GCGGCAAGGG CGGCGGCCCC GGTGGCGGCA GTCGGATGGG CCGCGGCCAA
GGCATGGAGG AACGCATCGA TCCGCGCCAG ATGGCCAGCC TGCAGAGCCT CAGCCTGGCC
TTCCTCGACG CCCGGGTGCG CGATGCCGCG CCGGCGCGCC TGTGGCTGGA ACGCGACGCC
GTCCAGTGGC TGGAGGCGAC CGGCCGGCTC GAGCGGAAAC CCTAA
 
Protein sequence
MSKSRLLRLS LPLLVGAVLA ACAGRPEPPG PPPGENLERR LEGAYRPAAR SPVRGWDDEW 
QVAGQAVEVS WLAPEQVQGM PLILYLPGLG ESSRDGVQWR RAWAEAGYAV LSVQPRQYGR
VIYSSSEAQV GVFRSLAQKS FADQALAARI AVLDQVLADL RRRAQAGEPN LARVDWQRLA
VAGFDLGAQT AAALAGERAA GASAPAGWQP RAAILLSPYV AEDGGGDRFG RIGTPLLAVT
GPHDEDPFGW VDPPSRRQRL WEGVRTSGSY QLIAAEASHR LLSGSFEDMA GAGGGRQGGP
SSGGRPEGGA GRGGSGGPGG GGGPGGGGGP GGGPAGGGRA GGPGGKGGGP GGGSRMGRGQ
GMEERIDPRQ MASLQSLSLA FLDARVRDAA PARLWLERDA VQWLEATGRL ERKP