Gene Francci3_2418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2418 
Symbol 
ID3906401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2808228 
End bp2809418 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content70% 
IMG OID637879748 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_481514 
Protein GI86741114 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.274162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.302095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACCC CGACTACCCC GACTACCCCG CACCGGCACC GAACGACGCG AAGGCTGGCC 
GCAGCGGCCG TCTCCGTCCT CACGACCGTC ACGCTCGCGG CCTGCGGGAA CGGCGGCGGC
AGCACGGCGT CGGGCGGGAC CCCCGTCCCC GGCGGGCGGC TGAAGATCGC CCTCTGGTCG
GACCTCCAGA ACTGCGTCGA CCCCAACCAG GTGTACTGGA TAGAGACGCG CAGCCTCGAC
CGGAACATCG CCGACTCACT GACCGATCAG GATCCCAGCA CCGGGAAGAT CGTTCCCTGG
CTGGCCAACA GCTGGACGGT GAGCCCCGAC GCCAGCGAGT ACACGTTCTC GCTGCGTAAA
GGTGTGACCT TCAGCGACGG CACCACGCTC GACGCGGCTG CGGTGAAGAC GGCCTTTGAC
GGGTTGCACG CGCTCGGCGC CAAGTCGACG CTGGGCCTGA GCTACCTGGC GAACTACAAG
GCGACGACGG TCGTCGACCC CACCACGGTG AAGGTCGCCT TCAAGGGCGC CGCCCACCTC
GACGGCATCG ACGTCACCTA CATCGCCGAG GACAGCGTCC GCGTCGGCCT TGACGATCCC
GTGCCCGGGC AGTACTTTCG CCCCCTCTGG GACGCGCTGC ACCTCGACTT CGGCCTCTCG
CTCACCCAGA ACACGACCGT CGGCGCGCTG CTCGCCGACA AGCTGCCCAC GACCCTCGCC
CTCGCGGCGG TCGCCGTCGT GCTCATGGCG GTGCTCGGCG TCACGGTCGC CTACCTCGCC
AGCTACCTCC AGTGGGGGCC GGCGCGGGCG CTGCTGCCCC GGCTGCCCGC CGTCGCCATC
TCGCTGCCGC CCTTCTTCGT GGGGCTGCTG CTCATCCAGA TCTTCGCCTT CTCCCTCGGC
TGGTTCCCGG CCACGGGCAC GGACGGGTGG CGCAGCCTGG TGCTGCCCGC GCTGACCCTG
TTCGGCGTCC TCGTCGGCCA CATGGTGGCG AACGCCGTGG TCGTCGAGAC CGTCTTCTCC
CGCAGCGGCA TCGGGCTCCT GGCCCAGCAG TCGGTCCTCA GCCAGGACGT GCCTGTCGTG
CAGGCGATCG TCCTGATCGC CGCGGCCCTG TTCGTCACCG TCAACCTGCT CGTCGACCTG
CTCTACTCGC TTCTCGACCC ACGCGTCGCG TGGACGCCCC GGGTCAACTA G
 
Protein sequence
MTTPTTPTTP HRHRTTRRLA AAAVSVLTTV TLAACGNGGG STASGGTPVP GGRLKIALWS 
DLQNCVDPNQ VYWIETRSLD RNIADSLTDQ DPSTGKIVPW LANSWTVSPD ASEYTFSLRK
GVTFSDGTTL DAAAVKTAFD GLHALGAKST LGLSYLANYK ATTVVDPTTV KVAFKGAAHL
DGIDVTYIAE DSVRVGLDDP VPGQYFRPLW DALHLDFGLS LTQNTTVGAL LADKLPTTLA
LAAVAVVLMA VLGVTVAYLA SYLQWGPARA LLPRLPAVAI SLPPFFVGLL LIQIFAFSLG
WFPATGTDGW RSLVLPALTL FGVLVGHMVA NAVVVETVFS RSGIGLLAQQ SVLSQDVPVV
QAIVLIAAAL FVTVNLLVDL LYSLLDPRVA WTPRVN