Gene Francci3_1774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1774 
Symbol 
ID3904004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2110335 
End bp2111411 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content70% 
IMG OID637879112 
Productsulfotransferase domain-containing protein 
Protein accessionYP_480879 
Protein GI86740479 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.588183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCCG CCGTTCCACC ACCTGGGTCG GCGCGCCCTG TCCTGATTAT TGGTACTGAG 
CGATCCGGGT CGAATCTTCT GCGGCTCATG CTCGACGCAC ACCCCGCCAT CGCGGTCCCG
CACCCGCCGC ACCTGATGAG GTACCTGGCC CCGCTGGCAG CGTCCTACGG AGACCTCGGT
GTTGCGGCCA ACCGCACACG GCTGGCCCGC GACGCGCTGC GGATCGTGCG TGCTCACCTG
CATCCTTGGC CGCATCCGGT AGATCTCGCA CGCGTGGTCC GCGAGTCCGA CGCCTCCACG
TTCGGCGTTG TAGCCGCGAT CTACGAACAG TACCGCGAGG CCGAGGACAA GCCGCGGTGG
GGGTGCAAGA GCACGTTCAT GGTCGACCAC ATCGACGAGG TGCTACGCCG CTATCCCGAC
GCCCGGTTCG TCTGGCTGGT CCGGGATCCC CTAGACGTCG CTGCCTCGGC CAAGCGTGCG
GTCTTCGGCC CAAGCATGCC CTATCGGATG GCCCGGCTAT GGCTACGCGA GCAGCGGTGC
GCGGACGCGG CGCTGGCGCG GCACGGGCCC GCGGTGGTAT ACCTGCTTCG CTACGAGGAC
TTGGTGACCG AGCCAGAGGG CGCGTTGAAC GAACTCTGCT CCTTTCTCGG CGAGCCCATG
CATGCCGGGA TGTTGCACCA TCATCTCACA TCGGGGGCGC GTCGGATCGG CGCGCTCGCC
GAGTCTTGGA GACGGGCCGC GCAGCCGGTC GGCGCCGACC GGATCGGCGC GCACCGCACC
GGCCTGACCG CCGCCGAGCG TAGGCAGGTG GCTGCGGTGG CTGCACCGCT GGCCCGGCGG
CTGGGCTACG ACCATGGCTC GGACGCCGAC GCCGCGCCGG AGGAGGTGGC GCCTTCGATG
GTCGCCATGG CGCTGCGCTC GGCTGGACTG CGCACCGTGA TCGAGGTGCG TTCGTTGTGC
CGGGACCGTA ACTACACCCG TAGGCTCCGG CGCGACGCGA CGGTGCGCTC GCTGCGGCTG
ACAGCGTGGG CGCGCACCCG GGTGCCAATG GAACTGCCCC AGTTGAGAAC CCGGTGA
 
Protein sequence
MSAAVPPPGS ARPVLIIGTE RSGSNLLRLM LDAHPAIAVP HPPHLMRYLA PLAASYGDLG 
VAANRTRLAR DALRIVRAHL HPWPHPVDLA RVVRESDAST FGVVAAIYEQ YREAEDKPRW
GCKSTFMVDH IDEVLRRYPD ARFVWLVRDP LDVAASAKRA VFGPSMPYRM ARLWLREQRC
ADAALARHGP AVVYLLRYED LVTEPEGALN ELCSFLGEPM HAGMLHHHLT SGARRIGALA
ESWRRAAQPV GADRIGAHRT GLTAAERRQV AAVAAPLARR LGYDHGSDAD AAPEEVAPSM
VAMALRSAGL RTVIEVRSLC RDRNYTRRLR RDATVRSLRL TAWARTRVPM ELPQLRTR