Gene Avin_38910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_38910 
Symbol 
ID7762780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3936203 
End bp3938581 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content61% 
IMG OID643806754 
Productsurface antigen 
Protein accessionYP_002801006 
Protein GI226945933 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR03303] outer membrane protein assembly complex, YaeT protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCC TGCTGCTGCC TGCGGTGGTC TTCGCCGCCA TGACGATCAC AGAGGTCCAC 
GCCGAGTCCT TCACCATCTC CGATATCCGT GTCAATGGTC TGCAGAGGGT TTCTGCCGGC
AGCGTATTCA GCGCCCTGCC GCTCAGCGTC GGCGATCAGG TGGACGACCA GCGCCTTGTG
GAGGGTACGC GCGCACTCTT CAAGACCGGT TTCTTTCAGG ATATCCAGCT TGGCCGCGAT
GGCAATGTCC TGGTCATCAC GGTCGTCGAG CGGCCTTCCA TCTCCAGCAT CGAACTGGAG
GGCAACAAGG CCATCAAGTC GGAAGACCTG CTGAATGGCC TGAAGCAGTC GGGACTCGCC
GAGGGCGAGA TCTTCCAGCG GGCCACCCTG GAAGGGGTGC GCAACGAGCT GCAGCGCCAG
TACGTAGCCC AAGGCCGTTA TTCGGCCTCG ATCGAGACCG AGGTCGTGCC GCAGCCGCGT
AACCGGGTGG CACTGAAGAT CAAGATCAAC GAAGGTTCGG TCGCCGCCAT CCAGCATATC
AACGTGGTGG GCAATACCGT CTTTTCGGAC GAAACCCTGC TCAGGCTGTT CGAACTGAAG
ACCAGCAACC TGCTGTCCTT CTTCCGCAAC GACGACAAGT ACGCTCGCGA GAAGCTGTCC
GGCGACCTGG AGCGTCTGCG TTCCTATTAT CTGGACCGCG GCTACATCAA CATGGATATC
GCCTCCACCC AGGTATCCAT CACTCCGGAC AAGCGCCATG TCTACATCAC CGTCAACATC
GACGAGGGCG AAAAGTACAG TATCCGCGAC GTCAAGCTCA CCGGCGATCT GAAGGTGCCG
CCCGAGGAGA TCGAGTCCCT GCTGCTGGTC AAGGAGGGAC AGGTGTTCTC CCGCAAGGTG
ATGACCAGTA CCTCCGAGTT GATAACCCGA CGCCTGGGCA ACGAAGGCTA TACCTTCGCC
AACGTCAATG CCGTGCCCGA GCCGCATGCC GAAGACAAGA CGGTTTCCGT GACCTTCGTG
GTCGACCCGG GCAAGCGGGC CTACGTCAAC CGCATCAACT TCCGCGGCAA CACCAAGACC
GAGGACGAAG TACTCCGTCG CGAGATGCGC CAGATGGAGG GCGGCTGGGC CTCGACCTAC
CTGATCGACC AGTCCAAGAC CCGTCTCGAG CGCCTGGGCT TCTTCAAGGA GGTCAGCGTT
CAGACGCCGC AGGTACCGGG CAGCGACGAT CAGGTCGACG TCAACTTCAC CGTCGAGGAG
CAGGCTTCCG GCTCCGTCAC CGCCAGCGTC GGCTTCGCCC AGAACGCCGG TCTGGTGCTG
GGTGGCTCGA TCAGCCAGAA CAACTTTCTC GGCAGCGGCA ACAAGGTCAC CATCGGCCTG
ACCCGCAGCG AGTACCAGAC CAACTACAAC TTCGGCTTCG TCGATCCCTA CTGGACCGAG
GATGGTATCA GCCTCGGCTA CAACGTGTTC TTCCGGACCA CCGACTACGA CGATCTGGAG
GTGGATTATT CCAGCTATTC GGTGGACAGT TTCGGCTCCG GCATCAACAT CGGCTATCCG
ATCAGCGAGA CCGCGCGTCT GAGCTTCGGC CTCAGTCTCC AGCAGGATAG CATCGATACC
GGAACCTACA CGGTCGACGA GATCTTCGAC TTCCTCGACG AGGAGGGCGA GGACTACCTG
AACTTCAAGG CTTCGGCGGG CTGGTCGGAG TCGACCTTGA ACCGGGGGGT GATGCCCACC
CGCGGCGCTT CGCAGAGCCT GACCTTCGAA ACCACGCTGC CCGGCAGCGA CCTGTCGTTC
TACAAGATCG ACTACAACGC CCAGATGTTC AAGGCGTTGA CGGACGATTA CACCCTGCGC
TTCCATACCA AGCTCGGCTA TGGCGACAGC TTCGGTTCCA CCTCGAAGAT GCCTTTCTAC
GAGCATTACT ATGCCGGCGG TTTTTCTTCC GTCCGGGGCT TCAAGGACAA CACCCTGGGC
GCGCGCAGTA CGCCGAGCCG GGGTGAGGCC GTGACCGGCA ACGAGGGGAC CGAGGAGGAT
TCGGATCAGG ACGAGCAGCC GTTCGGCGGC AACGTGCTGG TGGTCGGCGG TGTCGAGATG
ATGTTCCCGA TGCCGTTCGT CAAGGATCAG CGCTCGCTGC GTACATCCCT GTTCTGGGAT
GTGGGCAACG TATTCGATAC CAACTGCAGT TCCTCGCAGA AGGAACGCAA CGACGACAGT
TGCGATATCG ACTTCAGCAA CATGGCCAGC TCGGTCGGGC TGGGCGTGAC CTGGGTCAGC
GGCTTCGGGC CGCTGAGCTT CAGCCTGGCC GTGCCGGTCA TGAAGCCCAA CGACGCAGAG
ACCCAGGTAT TCCAGTTCTC AATGGGTCAG AGCTTCTAA
 
Protein sequence
MKRLLLPAVV FAAMTITEVH AESFTISDIR VNGLQRVSAG SVFSALPLSV GDQVDDQRLV 
EGTRALFKTG FFQDIQLGRD GNVLVITVVE RPSISSIELE GNKAIKSEDL LNGLKQSGLA
EGEIFQRATL EGVRNELQRQ YVAQGRYSAS IETEVVPQPR NRVALKIKIN EGSVAAIQHI
NVVGNTVFSD ETLLRLFELK TSNLLSFFRN DDKYAREKLS GDLERLRSYY LDRGYINMDI
ASTQVSITPD KRHVYITVNI DEGEKYSIRD VKLTGDLKVP PEEIESLLLV KEGQVFSRKV
MTSTSELITR RLGNEGYTFA NVNAVPEPHA EDKTVSVTFV VDPGKRAYVN RINFRGNTKT
EDEVLRREMR QMEGGWASTY LIDQSKTRLE RLGFFKEVSV QTPQVPGSDD QVDVNFTVEE
QASGSVTASV GFAQNAGLVL GGSISQNNFL GSGNKVTIGL TRSEYQTNYN FGFVDPYWTE
DGISLGYNVF FRTTDYDDLE VDYSSYSVDS FGSGINIGYP ISETARLSFG LSLQQDSIDT
GTYTVDEIFD FLDEEGEDYL NFKASAGWSE STLNRGVMPT RGASQSLTFE TTLPGSDLSF
YKIDYNAQMF KALTDDYTLR FHTKLGYGDS FGSTSKMPFY EHYYAGGFSS VRGFKDNTLG
ARSTPSRGEA VTGNEGTEED SDQDEQPFGG NVLVVGGVEM MFPMPFVKDQ RSLRTSLFWD
VGNVFDTNCS SSQKERNDDS CDIDFSNMAS SVGLGVTWVS GFGPLSFSLA VPVMKPNDAE
TQVFQFSMGQ SF