Gene Francci3_4239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4239 
Symbol 
ID3907205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5057602 
End bp5058603 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content69% 
IMG OID637881565 
Productperiplasmic binding protein 
Protein accessionYP_483314 
Protein GI86742914 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.364549 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC TCGGCGCCGG CGTCGCGGTC GCTGCCCTGC TCGTTGTTAC CGCCGGTCTC 
ACCGGCTGCG GGTCCGGTTC GACCACCGGC GCGGGCAGTG CGAGCGCGGC CGATGCCCGG
TCGGCCCCGG CCGGTACCCG CTATCCCGTT GAGGTGGCGA ACTGCGGCCG GACCCTGCAC
TTCGACGAGG CCCCGGCGCG GGTCGTCTCC GGGTGGACCA CCAGTACCGA GCTCCTCATC
GAACTGGGCC TGACGGACCG GATTGTTGGC CAGTACAACA CCAGCAGCGG CACACCGGCG
GCGAAGTACG CCAGTGTGGA GGCGAAGCTA CCCGCCCTCG GTACCGGCGC ACCGACCCGG
GAGGCGCTGC TCGCCGCCCG GCCCGACCTG ATCTGGGCGG ATGGCAGCTA TCTGTTTGAC
GGCCGGCAGC TCCCCACGAT CGCCGAGCTG GCCGCCCAGG GCACCCAGGT GATGATTCTC
AGCGGGTTCT GCACCGACGA CGCCACCAAG GCGACGGTCC GTGACGTCGA TACCGACCTG
ACCGCGCTCG GCATGATCTT CGGCATTCCG GGCCGGGCCA GGCAGGTCCA GGCCGACATC
AACGAACGGC TCCGCCAGGT CGCGACGAAG ATCCAGGGCA GGGACCCCGT CCCGGTCGCG
GTGGTCGCCA CCTACCAGGG AACCGTCTAC ACCTACGACG GCGTCTACAC CGACATCGCG
CGGCTCGCCG GGGCGCGGAA CATCTACGCG GGCACCCTGC CCAAGGGCAA ATACTTCAGT
GAGCTGTCGG TGGAGGACCT CATCAGCAAG AATCCCGGCA CCCTCGTCTA TCTGCTCAGC
GGCAGCGAAA CCGAGGCCGA GGCCCGCAAG TTCCTCACCT CCCGGCTCCC GACCGTGGCG
GCAGTCCGCA ACAACCGTGT GTTCTTCCTC CCGCAGTCGG ACTCGGCCAA CCTCGCCGGC
GTGGAAGGCG TGACAAAGCT GGCCGCCGCG CTGCACGCCT GA
 
Protein sequence
MKKLGAGVAV AALLVVTAGL TGCGSGSTTG AGSASAADAR SAPAGTRYPV EVANCGRTLH 
FDEAPARVVS GWTTSTELLI ELGLTDRIVG QYNTSSGTPA AKYASVEAKL PALGTGAPTR
EALLAARPDL IWADGSYLFD GRQLPTIAEL AAQGTQVMIL SGFCTDDATK ATVRDVDTDL
TALGMIFGIP GRARQVQADI NERLRQVATK IQGRDPVPVA VVATYQGTVY TYDGVYTDIA
RLAGARNIYA GTLPKGKYFS ELSVEDLISK NPGTLVYLLS GSETEAEARK FLTSRLPTVA
AVRNNRVFFL PQSDSANLAG VEGVTKLAAA LHA