Gene Saro_1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1080 
Symbol 
ID3916376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1124402 
End bp1125430 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content67% 
IMG OID640443815 
ProductFlp pilus assembly CpaB 
Protein accessionYP_496359 
Protein GI87199102 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3745] Flp pilus assembly protein CpaB 
TIGRFAM ID[TIGR03177] Flp pilus assembly protein CpaB 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAGA GGAAGCTGAT GTTGCTGGTG GGGGCGCTGA TCGTAGCGAT CGGCACGGCC 
TTCGCAGCAA GGAGCCTTTT CGCCGGCAAT TCGACGCCCC AGGCTGAAGC CGCGGCCAAG
GTGCCGACCG GCCCCAAGGT CCTCGTGGCG CAGCGCGCGC TTCCGGTCGG CACGATCATC
ACAGCCGATT CAATCAACTT CCAGGCCTGG CCCAAGGACA TGGTGCAGGA CGCCTACTTC
GTCGAAGGCG AGGCGGACAT GCAGAAGCTG CTCGGCACCG TCGTCCGCAA TCCGATCACG
GCGGGCGAAC CGGTGACCAA GGGCAATCTT GTCGCCCCCG GCGACCGCGG CTTCCTCGCT
GCTGCTCTCG GTGCCGGCAT GCGCGCCGTC ACCATCCCTG TTTCCGCACG CACCGGCGTT
GCCGGCTTCG TCTTCCCGGG CGATCACATC GATCTCGTGC TGACCCAGAC TGTCAAGGGC
ACGGGCGAAG GCATGGCGCT CAAGGCGTCG GAGACGATCC TCAAGAACCT GCGCGTCCTT
GCCACCGACC AGTCGACCGA ACAGGAACAG GTCGAAGGCA AGACCCGCGT CCGCACCTTC
AGCACCGTCA CACTCGAAGT GACGCCCAAG ATCGCCGAGA AGATTGCGGT CGCGCAGACC
ATCGGTACGA TCAGCCTCTC GCTGCGCTCG CTGGCCGACA ACTCGGCCGA GCTGGAGCAG
GCCATTGCCG CCGGCGACGT CAAGATCCCG GCAGGCGTGA CCAAGCAGCA GGAAGAGGCC
CTGCTCCAGC AGGCGATGAA CCGCCCGCTG GGCGGCGGCA AGGACAGCTT CGTCACCGGC
GGCGACGTCT CGCGCTTCCA GCGCAAGACC ATGCCCGTCA CGGCACCCGC AGCCGGCATG
GGCGCCCCGC AGATGGCAGC CGCCACCGGC CCGCAAATGG CCTCCAACGC TGCCCCCGTT
CGCCGCGGCC CGGTCGTCCG TGTGACCCGC GGCAAGGAAA CAGAATCCGT CTCGCTGGGA
GGGAACTGA
 
Protein sequence
MDKRKLMLLV GALIVAIGTA FAARSLFAGN STPQAEAAAK VPTGPKVLVA QRALPVGTII 
TADSINFQAW PKDMVQDAYF VEGEADMQKL LGTVVRNPIT AGEPVTKGNL VAPGDRGFLA
AALGAGMRAV TIPVSARTGV AGFVFPGDHI DLVLTQTVKG TGEGMALKAS ETILKNLRVL
ATDQSTEQEQ VEGKTRVRTF STVTLEVTPK IAEKIAVAQT IGTISLSLRS LADNSAELEQ
AIAAGDVKIP AGVTKQQEEA LLQQAMNRPL GGGKDSFVTG GDVSRFQRKT MPVTAPAAGM
GAPQMAAATG PQMASNAAPV RRGPVVRVTR GKETESVSLG GN