Gene Snas_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3669 
Symbol 
ID8884868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3900270 
End bp3901565 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content65% 
IMG OID 
ProductCitrate transporter 
Protein accessionYP_003512420 
Protein GI291301142 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.955706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0571681 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCA TCGCCTGGGT CGCGGTGGCG GTCTTCGTCG GCGCCTATGC CCTGATCGCG 
ACCGAGAAGA TTCACCGCAT CGCCGCCGCG TTGGGCGGAG CGGCGATCAT GCTCGTCATC
GGCGCCACCA CACCCGAAGA CGCGTTCTTC AGCGAGAAGT CAGGCATCGA CTGGAACGTC
ATCTTCCTGC TGCTGGGGAT GATGTTGATC GTGTCGGTAC TGCGCCGCAC CGGGGTCTTC
GAGTACCTGG CGATCTGGGC GGTCAAACGC GCCGCAGGCC GCCCGTATCG GGTGATGGTG
CTGCTGGTCC TGGTCACCGC GCTGGCCTCG GCAATCCTGG ACAACGTCAC CACGATCCTG
CTCATCGCCC CGGTGACGTT CCTGGTGTGC GAGCGGCTGA AGGCACCGGT GGCGCCGTTT
CTGATCGCCG AGGTGTTGGC CTCCAACATC GGCGGAACCG CGACCCTGGT CGGGGATCCA
CCCAACATCA TCATCGCCGC TCGCGCCGAT CTGTCCTATA ACGACTTCCT CATCCACCTC
GCGCCGATTG TGCTCATCCT GCTGGTGGTG TTCATCGGAT TGTGCCGCGT CATGTTCCGT
TCCGCGTTCA CCTACGACCC CGACACCGCC GCCCAGCTGG CGCGGCTGCG CGAACGCGAC
GCCATCAAGG ACTCCCGACT GCTGGTCATC AGTTTGGTGA TGCTGGTCGT GGTGACGGCC
GCGTTCATGG TCCACACCGT CATCCACATC GAACCGTCGG TGGTGGCGAT GGTCGGTGGC
CTGGGCCTGT TGGCGTTGTC GCGTTTGAAC ACCGACGCGG TGCTCAAGGA CGTCGAGTGG
CACACGCTGG TGTTCTTCGC AGGTTTGTTC ATCCTGGTCG GTTCGCTGGT GTCCACCGGC
GTCATCGCGC ACGTCTCGCA GGCCGCCACC GAAGCCACCG GTGACCGGGT ACTGGGAGCG
TCCATGTTGC TGCTGTGGGG GTCGGCGTTC CTGTCGGCCA TCGTGGACAA CATCCCGTAC
GTGACGACGA TGAGCCCGGT CGTGGCCGAC ATGGCCGCGG CTCAACCCGG CGACAGCGGC
CAGGTGCTGT GGTGGTCGCT GGCCCTGGGC GCCGACCTCG GCGGTAACGC CACCGCCGTG
GGCGCTTCGG CCAACGTCGT CATGCTGGGC ATGGCCGAAC GCGCGGGCAA GAAGATCAGC
TTCTGGCAGT TCACCAAGTA CGGACTCGTC ACCACCGTCG TCACCATCGC CCTGGCCACG
CCCTACCTGT GGCTGCGCTA CTTCGTACTC ACCTGA
 
Protein sequence
MSVIAWVAVA VFVGAYALIA TEKIHRIAAA LGGAAIMLVI GATTPEDAFF SEKSGIDWNV 
IFLLLGMMLI VSVLRRTGVF EYLAIWAVKR AAGRPYRVMV LLVLVTALAS AILDNVTTIL
LIAPVTFLVC ERLKAPVAPF LIAEVLASNI GGTATLVGDP PNIIIAARAD LSYNDFLIHL
APIVLILLVV FIGLCRVMFR SAFTYDPDTA AQLARLRERD AIKDSRLLVI SLVMLVVVTA
AFMVHTVIHI EPSVVAMVGG LGLLALSRLN TDAVLKDVEW HTLVFFAGLF ILVGSLVSTG
VIAHVSQAAT EATGDRVLGA SMLLLWGSAF LSAIVDNIPY VTTMSPVVAD MAAAQPGDSG
QVLWWSLALG ADLGGNATAV GASANVVMLG MAERAGKKIS FWQFTKYGLV TTVVTIALAT
PYLWLRYFVL T