Gene Francci3_3571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3571 
Symbol 
ID3904510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4268635 
End bp4269708 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content66% 
IMG OID637880892 
ProductABC transporter, substrate-binding protein, aliphatic sulphonates 
Protein accessionYP_482652 
Protein GI86742252 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCCGA GCAGACTGTG GCGTGCAGCG ACGCCACTCA TCGCCGCGGG GGCGCTTGTA 
CTGGCGGTGG TGGGCTGCTC GAGTGACGAC GGGGACTCCC AGACGGTCGC CTCGAACGGC
CAGATCACCG GCACCCTACG GCTCGGGTAC TTCCCGAACC TCACCCACGC GCCAGCTCTC
TACGGCGCCG AGAAGGGGAT CTTCGCCAAG GACCTCGGCT CTGGAGTCAC GCTCAAGACA
TCGACCTTCA ACTCCGGCGT GCAGGAGGCG GAGGCGATTC TGTCCGGCGC CATCGATGCC
GGTTATATCG GTCCCAACCC GGCGGTCAAC AGCTTCATCA AGTCGCACGG GGAGGCGGTG
CGGATCGTCT CCGGCGCTAC CTCGGGTGGT GCGTCGCTGG TGGTCAAGCC TGAGATCACC
TCCGTCGCGC AGCTCAAGGG CACCACGCTC GCCACCCCGA GCCTGGGCAA CACCCAGGAC
GTGGCGCTGC GGTACTTCCT CAAGAAGAAC GGGCTGAAGA CCGACACCCA GGGTGGGGGT
GATGTCTCCA TCAAGCCCCA GGACAACACC GTGACGGTGG ACGCCTTCGC CAACAGGGCC
ATCGACGGGG CCTGGGTGCC CGAGCCGACG GCCTCCCGGC TCGTCGCAGC GGGGGGCAAG
GTGCTCGTCG ACGAGGCCGA CGAGTGGCCG GAGACCAAGG GCCAGTTCGT CACGACGGTG
CTTCTGGTGA GCACGGACTA TCTGAAGAAG AACCCGGAGA TCGTCCGCCG GCTCATCACG
GCGAATGTGG AATCGATCAA CGCGCTCAAC GCCGACCGTG ACGCCGGCGC GAAGGTCACC
AACACGGCGC TGGGCAAGCT GAGCGGCAAG CCGCTGTCCG ACAAGATCCT CACCTTGGCC
TGGAAGGGGC TGACCTTCAC TCCGGACCCG ATCGCGGCGT CCCTGTTCAC CTCGGCAAAG
CACCAGGAGG AGCTCGGCCT CATCAAGAAC CCGAAGCTCG ACGGTCTGTT CGATCTGACG
GTCCTCAACG AGATCCTCGC CAAGCAGGGC AAACCGACGG TCGCCGACTC CTGA
 
Protein sequence
MRPSRLWRAA TPLIAAGALV LAVVGCSSDD GDSQTVASNG QITGTLRLGY FPNLTHAPAL 
YGAEKGIFAK DLGSGVTLKT STFNSGVQEA EAILSGAIDA GYIGPNPAVN SFIKSHGEAV
RIVSGATSGG ASLVVKPEIT SVAQLKGTTL ATPSLGNTQD VALRYFLKKN GLKTDTQGGG
DVSIKPQDNT VTVDAFANRA IDGAWVPEPT ASRLVAAGGK VLVDEADEWP ETKGQFVTTV
LLVSTDYLKK NPEIVRRLIT ANVESINALN ADRDAGAKVT NTALGKLSGK PLSDKILTLA
WKGLTFTPDP IAASLFTSAK HQEELGLIKN PKLDGLFDLT VLNEILAKQG KPTVADS