Gene Francci3_3569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3569 
Symbol 
ID3904508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4266653 
End bp4267690 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content70% 
IMG OID637880890 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_482650 
Protein GI86742250 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0600] ABC-type nitrate/sulfonate/bicarbonate transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0917022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACTG ACGTCAGCGT CTCCAGCGAT AAGAGTGTCC TGCTCGGCGA GGCGGGTAAG 
TCCTTCGACG TCGACACGGC CCTCGCCGGC CTCGACGCGC TGGACATCCC GACCGCGCAA
CGGCGGCCGT TCGCGGTCCG CGTCTGGGCG GCCAGCTGGC CGAAGCTCGG CGCTCTGCTC
CTGTTCCTGC TGCTCTGGCA GATCGTCGTC TGGAGTGGCT GGAAGCCCAG CTATGTGTTG
CCCGGCCCGG GCGAGGCACT GGGCGAGTTC GTCGACCAGC TGGGCAGCGG ACACTTCTGG
GACGCCCTCG CCCGCACCCT GGTCCGGGCC CTGGAGGGAT ACGCTCTCGC CGTGCTCATC
GGCACCGTGG TCGGGATCGC GGTGTCCCGC TTCGGAGTCC TGCGTACCGC GGTGGGATCG
TTCATCACCG CGGTGCAGAC GATGCCCTCG ATCGTCTGGT TCCCGCTCGC CGTCCTGCTG
TTCAAGCTCA GCGAGTCCGC AATCATGTTC GTGGTGGTGC TCGGGGCGGC GCCCTCGGTG
GCCAACGGCG TCATCTACGG CGTGGACTAC GTGCCGCCTC TGCTGGTCCA GGTCGGCCGC
AGCATGGGCG CGCGTAGCCT GTCCCTCTAC CGGTACGTCG TGGTACCGGC GGCCCTGCCC
TCGGTGCTCG CGGGCCTGAA GCAGGGCTGG GCGTTCGCTT GGCGCAGCCT GATGGCCGGC
GAATTGCTGG TCATCGTTCC GGGGCATCCG TCCGTCGGAG CTGACCTGCA GAATGCCCGC
GAACTACTCG ACACGGTCGG GGTACTGGCC TCGATGATCA CGATTTTCGT GATCGGGGTG
CTCATCGACG CCGGGTTCAA CGCGGCGGAC CAGCGGATGC GGCAGCGCCG TGGGCTGGTT
GCGGAGGGTA CGACGGCGGT CCGGGCAGCT CGGCGTCGCC GGAGCGGCAG CGCCGACGGG
TCCGCCGCGG CCGCCACCAC GGCCACCTCC TCCGCCGGAG GCGGCACGGA CGCCCCCCGT
CCCGAGAGGG CTGGCTGA
 
Protein sequence
MATDVSVSSD KSVLLGEAGK SFDVDTALAG LDALDIPTAQ RRPFAVRVWA ASWPKLGALL 
LFLLLWQIVV WSGWKPSYVL PGPGEALGEF VDQLGSGHFW DALARTLVRA LEGYALAVLI
GTVVGIAVSR FGVLRTAVGS FITAVQTMPS IVWFPLAVLL FKLSESAIMF VVVLGAAPSV
ANGVIYGVDY VPPLLVQVGR SMGARSLSLY RYVVVPAALP SVLAGLKQGW AFAWRSLMAG
ELLVIVPGHP SVGADLQNAR ELLDTVGVLA SMITIFVIGV LIDAGFNAAD QRMRQRRGLV
AEGTTAVRAA RRRRSGSADG SAAAATTATS SAGGGTDAPR PERAG